An optimal algorithm and extensions for the MoJo distance measure
نویسنده
چکیده
A problem that the software industry frequently faces is the maintenance and improvement of legacy software systems. Though most legacy software systems are still working well, their structure is often no longer understood. When one wants to migrate a legacy software system to a new operating system or different programming language, or to add to its functionality, it is essential to recover the structure of the legacy software system before any change is made on it. Many reverse engineering projects attempt to regain this knowledge. A common approach to the problem of understanding a large software system is to decompose it into smaller, easier to comprehend subsystems. Though such approaches can aid the process of understanding legacy software systems, an important issue for the current software clustering techniques is that they are hard to evaluate. It is clear that an objective way of comparing different software clustering decompositions is necessary. In this thesis, we concentrated on comparing different software clustering techniques by comparing their output decompositions. We have improved a method for comparing the output of different software clustering approaches called MoJo and enhanced a metric for evaluating the quality of a software clustering approach. We also introduced a new variation of MoJo that integrates edge information to the MoJo measure. The approaches presented in this thesis have been implemented and applied to real industrial software systems. The results we obtained demonstrate the effectiveness and usefulness of our techniques.
منابع مشابه
Degree of Optimality as a Measure of Distance of Power System Operation from Optimal Operation
This paper presents an algorithm based on inter-solutions of having scheduled electricity generation resources and the fuzzy logic as a sublimation tool of outcomes obtained from the schedule inter-solutions. The goal of the algorithm is to bridge the conflicts between minimal cost and other aspects of generation. In the past, the optimal scheduling of electricity generation resources has been ...
متن کاملA partition-based algorithm for clustering large-scale software systems
Clustering techniques are used to extract the structure of software for understanding, maintaining, and refactoring. In the literature, most of the proposed approaches for software clustering are divided into hierarchical algorithms and search-based techniques. In the former, clustering is a process of merging (splitting) similar (non-similar) clusters. These techniques suffered from the drawba...
متن کاملOptimization with the time-dependent Navier-Stokes equations as constraints
In this paper, optimal distributed control of the time-dependent Navier-Stokes equations is considered. The control problem involves the minimization of a measure of the distance between the velocity field and a given target velocity field. A mixed numerical method involving a quasi-Newton algorithm, a novel calculation of the gradients and an inhomogeneous Navier-Stokes solver, to find the opt...
متن کاملA Hybrid Time Series Clustering Method Based on Fuzzy C-Means Algorithm: An Agreement Based Clustering Approach
In recent years, the advancement of information gathering technologies such as GPS and GSM networks have led to huge complex datasets such as time series and trajectories. As a result it is essential to use appropriate methods to analyze the produced large raw datasets. Extracting useful information from large data sets has always been one of the most important challenges in different sciences,...
متن کاملA New Similarity Measure Based on Item Proximity and Closeness for Collaborative Filtering Recommendation
Recommender systems utilize information retrieval and machine learning techniques for filtering information and can predict whether a user would like an unseen item. User similarity measurement plays an important role in collaborative filtering based recommender systems. In order to improve accuracy of traditional user based collaborative filtering techniques under new user cold-start problem a...
متن کامل